‘Tue AsrmopnysicAL JOURNAL SUPPLEMENT SERIES, 269:3 (25pp), 2023 November 


OPEN ACCESS. 


Siena Galaxy Atlas 2020 


John Moustakas'@, Dustin Lang* 6, Arjun Dey", Stéphanie Juneau? G, Aaron Meisner'G, Adam D. Myers 
Edward F. Schlafly’ ©, David J, Schlegel”, Francisco Valdes? C, Benjamin A. Weaver’, and Rongpu Zhou" ® 
"Department of Physics and Astronomy, Siena College, 515 Loudon Road, Loudonville, NY 12110, USA: jmoustakas@siena.edu 
2 Perimeter Institute for Theoretical Physics. 31 Caroline Steet North, Waterloo, ON N2L 2¥5, Canada 
* NSF's NOIRLab, 950 N. Cherry Avenue, Tucson, AZ 85719, USA 
* Department of Physics and Astronomy, University of Wyoming, Laramie, WY 82071, USA 
$ Space Telescope Science Institute, 3700 San Martin Drive, Baltimore, MD 21218, USA. 

* Lawrence Berkeley National Laboratory, | Cyclotron Road, Berkeley. CA 94720, USA 
Received 2023 July 10; revised 2023 August 23; accepted 2023 August 24; published 2023 October 18 


Abstract 


We present the 2020 version of the Siena Galaxy Atlas (SGA-2020), a multiwavelength optical and infrared 
imaging atlas of 383,620 nearby galaxies. The SGA-2020 uses optical grz imaging over 20,000 deg? from the 
Dark Energy Spectroscopic Instrument (DESI) Legacy Imaging Surveys Data Release 9 and infrared imaging in 
four bands (spanning 3.4-22 zm) from the 6 year unWISE coadds: it is more than 95% complete for galaxies larger 
than R(26) © 25" and r< 18 measured at the 26 mag arcsec ^ isophote in the r band. The atlas delivers precise 
coordinates, multiwavelength mosaics, azimuthally averaged optical surface-brightness profiles, model images and 
photometry, and additional ancillary metadata for the full sample. Coupled with existing and forthcoming optical 
spectroscopy from the DESI, the SGA-2020 will facilitate new detailed studies of the star formation and mass 
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assembly histories of nearby galaxi 
and fundamental plane relations; 
multimessenger astronomical events; and more. 


nable precise measurements of the local velocity field via the Tully-Fisher 
ve as a reference sample of lusting legacy value for time-domain and 


Unified Astronomy Thesaurus concepts: Galaxy photometry (611); Celestial objects catalogs (212) 


1. Introduction 
1.1. Scientific Context 


Although a broad theoretical framework exists for under- 
standing the physics of galaxy formation, many key questions 
remain unanswered. In this framework, the star formation and 
stellar mass assembly histories of galaxies are intimately linked. 
to the hierarchical buildup of dark matter halos, modulated by a 
time-varying interplay between internal (secular) and external 
(environmental) processes (e.g., Kennicutt 1998; Kormendy & 
Kennicutt 2004; Blanton & Moustakas 2009; Somerville & 
Davé 2015; Naab & Ostriker 2017; Wechsler & Tinker 2018). 
Identifying these physical processes and their relative 
importance—as a function of cosmic time—remains one of 
the foremost outstanding problems in observational cosmology. 

Some of the most pressing outstanding questions include the 
following: 


1. How and why does star formation in galaxies cease? Why 
do low-mass galaxies exhibit more extended star 
formation histories than massive galaxies? 

What is the nature of inside-out galaxy formation—did 

galaxies start growing earlier in their inner parts, did they 

end star formation earlier in those parts, or both? 

3. How does feedback from active galactic nuclei, super- 
novae, stellar winds, and other effects regulate star 
formation? 

4. What was the relative impact of mergers versus secular 
processes on the structure of present-day galaxies, 
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including their bulges, bars, spirals, rings, warps, shells, 
and pseudobulges? 

5. How are all these detailed processes affected by the 
relationship between galaxies and their host dark matter 
halo—for example, whether they are central or satellite 
galaxies? 


Investigating these and related questions requires an accurate 
and detailed view of the present-day galaxy population—the 
population we can potentially understand the best. In particular, 
the galaxies that are near enough or intrinsically large enough 
in terms of their apparent angular diameter to be spatially well 
resolved can be studied in significantly greater detail than more 
distant, spatially unresolved galaxies.’ For example, in typical 
ground-based optical imaging, the galaxies with isophotal 
diameters larger than ~10’-20" can be used to study the 
properties of their disk and spheroidal components as separate, 
to identify bars, rings, disk asymmetries, and 
structures; to discover and characterize their 
rightness features such as stellar streams, tidal 
tails, and outer envelopes; to unveil faint, low-mass satellites; 
and much more. 

Because of their unique and high-impact scientific potential, 
the compilations or atlases of large, nearby galaxies have a 
long, rich heritage in astronomy. In 1774, nearly 150 yr before 
observations confirmed that the Milky Way was just one of 
many "island universes,” Charles Messier published his 
Catalogue des Nébuleuses et des Amas d'Etoiles, which 
includes 40 (now-famous) galaxies among a full catalog of 
110 objects. Subsequently, building on the naked-eye 


T We differentiate here between dwarf galaxies in the Local Group whose 
individual stars can be resolved (Mateo 1998), and galaxies, which are near or 
large enough to be spatially resolved into components (e.g., bulge versus disk), 
‘but which must still be studied via their integrated light. 
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surveying effort of William Herschel and his sister Caroline 
and son John (Herschel 1786, 1864), John Louis Emil Dreyer 
spent more than two decades assembling the New General 
Catalogue of Nebulae and Clusters of Stars (NGC) and the 
Index Catalogues (IC), a sample of approximately 15,000 
Galactic and extragalactic objects whose designations are still 
in wide-spread use today (Dreyer 1888, 1912). The NGC, IC, 
and other early catalogs laid the foundation for several 
important galaxy atlases published in the second half of the 
20th century covering most of the sky. These atlases used 
photographic imaging from the Palomar Observatory Sky 
Survey in the 1950s and 1960s (Minkowski & Abell 1963; 
Reid & Djorgovski 1993) and the UK Schmidt Southern Sky 
Survey in the 1970s (Cannon 1979), and include the following: 
the Catalogue of Galaxies and of Clusters of Galaxies 
(Zwicky et al. 1968); the Uppsala General Catalog of Galaxies 
(UGC) and its Addendum (Nilson 1973, 1974); the 
Morphological Catalog of Galaxies (Vorontsov-Velyaminov 
& Krasnogorskaya 1974); the ESO/Uppsala Survey of the 
ESO (B) Atlas (ESO; Lauberts 1982); the Principal Galaxies 
Catalogue (PGC; Paturel et al. 1989); among others. 
Eventually, data from these and other catalogs were assembled 
into the indispensable Third Reference Catalog of Bright 
Galaxies (RC3; de Vaucouleurs et al. 1991; Corwin et al 
1994). The RC3 contains extensive information on 23,011 
nearby galaxies and is purportedly complete for galaxies 
With Boys, 15.5, DQS) » l^, and v < 15000kms 
(2 «5 x 10 7), although it does include a number of object 
outside these limits that are of special interest." 

The advent of wide-area (>10° deg”), multiwavelength 
imaging surveys has produced the next-generation atlases of 
large angular-diameter galaxies in the ultraviolet and infrared, 
including the the Two Micron All Sky Survey 2MASS) Large 
Galaxy Atlas (Jarrett et al. 2003), the NASA-Sloan Atlas 
(NSA; Blanton et al 2005, 2011), the Galaxy Evolution 
Explorer (GALEX) Ultraviolet Atlas of Nearby Galaxies (Gil 
de Paz et al. 2007), the Spitzer Survey of Stellar Structure in 
Galaxies (Sheth et al. 2010), the z = 0 Multiwavelength Galaxy 
Synthesis (Leroy et al. 2019), and the Wide-Field Infrared 
Survey Explorer (WISE) Extended Source Catalog of the 100 
Largest Galaxies (Jarrett et al. 2019), However, with the 
exception of the NSA, the optical sizes, shapes, and total 
magnitudes for the objects in these atlases are based on the 
photographic-plate measurements published in the RC3. The 
NSA, meanwhile, delivers new optical measurements of nearby 
(< 0.05) galaxies using +8000 deg” of ugriz imaging from 
the Sloan Digital Sky Survey (SDSS; York et al. 2000), but is 
230% incomplete at r $ 14 (Wake et al. 2017). 


1.2. The Need for a New Large-galaxy Atlas 


With the historical context in mind, several recent 
developments motivate a renewed effort to assemble a uniform 
data set of large angular-diameter galaxies. First, three new 
ground-based optical imaging surveys jointly called the Dark 


F- DOS) represents the diameter of the galaxy at the 25 mag arcsec" isophote 
in the optical and is a well-established and historically important measure of the 
“size” of a galaxy popularized by the RCS. 

7. Note that the incompleteness in the SDSS Main Survey (Strauss et al. 2002) 
impacts all other SDSS galaxy catalogs such as Simard et al 

et al. (2015). The NSA mitigates this incompleteness to some degre 
utilizing redshifts from other surveys, but a nonnegligible fraction of the 
largest, brightest galaxies in the local universe are sil missed. 
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Energy Spectroscopic Instrument (DESI) Legacy Imaging 
(hereafter, the Legacy Surveys) have delivered deep 
imaging in g, r, and z over ~20,000 deg? of the extragalactic 
(e. low Galactic extinction) sky (Dey et al. 2019; D. J. 
Schlegel et al. 2023, in preparation). These data provide 
exquisite photometric and astrometric precision and reach 
1-2 mag deeper than either SDSS or Pan-STARRS1 (Cham- 
bers et al. 2016). In addition, the development of the state-of- 
the-art image modeling code The Tractor provides a 
computationally tractable means of working with multiband, 
multipass, variable-seeing imaging, enabling dedicated studies 
of large angular-diameter galaxies (Lang et al. 2016; D. Lang 
et al, 2023, in preparation). 

Second, in 2021 May, the DESI Survey began a 5 year 
program (2021-2026) to obtain precise spectroscopic redshifts 
for an unprecedented sample of more than 40 million galaxies 
and 10 million stars over the +14,000 deg” Legacy Surveys 
footprint accessible from the 4 m Mayall Telescope at Kitt Peak 
National Observatory (KPNO; DESI Collaboration et al 
2016a, 2016b; Abareshi et al. 2022). As part of this effort, 
the DESI Bright Galaxy Survey (BGS) is obtaining spectra and 
redshifts for a statistically complete sample of >10 million 
galaxies brighter than r= 20.175 (Hahn et al. 2023). A high- 
quality photometric catalog of the largest angular-diameter 
galaxies over the DESI footprint is needed to ensure the BGS 
has high completeness and does not suffer from the 
photometric shredding and spectroscopic incompleteness of 
the SDSS at bright magnitudes (Wake et al. 2017). 

To address these and other needs, we present the 2020 
version of the Siena Galaxy Atlas (SGA), SGA-2020, an 
optical and infrared imaging atlas of nearly 4 x 10° galaxies 
approximately limited to an angular diameter of 25” at the 
26 mag arcsec ^? isophote for galaxies brighter than r= 18 over 
the 20,000 deg? footprint of the DESI Legacy Imaging Surveys 
Data Release 9 (LS/DR9; Dey et al. 2019; D. J. Schlegel et al. 
2023, in preparation; see Figure 1). The SGA-2020 delivers 
precise coordinates, multiwavelength mosaics, azimuthally 
averaged optical surface-brightness and color profiles, model 
images and photometry, and additional metadata for the full 
sample. Notably, for many of the largest (e.g.. NGC/IC/ 
Messier) galaxies in the sky, especially outside the SDSS 
imaging footprint, the SGA-2020 delivers the first reliable 
measurements of the optical positions, shapes, and sizes of 
large galaxies since the RC3 was published more than 
30 yr ago. 

By combining existing (archival) spectroscopic redshifts 
with forthcoming DESI spectroscopy, the SGA-2020 will spur 
a renewed effort to tackle several outstanding problems in 
extragalactic astrophysics, particularly the interplay between 
galaxy formation and dark matter halo assembly. It will also 
Support studies of time-domain and multimessenger astronom- 
ical events, which are often hampered by incomplete or 
heterogeneous catalogs of large, nearby galaxies, which are 
most likely to host the electromagnetic counterparts of 
gravitational wave events (Gehrels et al. 2016; Abbott et al. 
2020) and other classes of transients. Moreover, within the 
14,000 deg? DESI footprint, the SGA-2020 is ensuring high 
photometric completeness for the BGS, and is facilitating high- 
impact ancillary science through a variety of secondary 
targeting programs (DESI Collaboration et al 20234: 


T0 tps: //sgaegacysurvey-org 


‘Tue ASTROPHYSICAL JOURNAL SUPPLEMENT SERIES, 269:3 (25pp), 2023 November Moustakas et al 


Figure 1. Optical mosaics of 42 galaxies from the SGA-2 ngular diameter from the top left to the bottom right. Galaxies are chosen randomly 
from a uniform (flat) probability distribution in angular diameter, The horizontal white bar in the lower left comer of each panel represents and the mosaic cutouts 
range from 32 to 13/4, This figure illustrates the tremendous range of galaxy types, sizes, colors, and surface-brightness profiles, internal structure, and environments 
of the galaxies in the SGA, 


Myers et al. 2023). For example, the DESI Peculiar Velocity particular interest for some readers may be Section 3.3, where 
Survey will place precise new constraints on the growth rate of we validate our surface-brightness profiles and summarize the 
large-scale structure by measuring the Tully-Fisher and principal SGA-2020 data products. In Section 4, we quantify 
fundamental-plane scaling relations (Tully & Fisher | the completeness of the SGA and review how it improves upon 
Djorgovski & Davis 1987) from SGA-2020 targets at =<0.15 existing large-galaxy catalogs, and in Section 5, we highlight 
(Saulder et al. 2023). And finally, the SGA-2020 will help some of the exciting potential scientific applications of the 
engage the broader public with visually striking color mosaics. SGA-2020. Finally, Section 6 summarizes the main results of 
of large, well-resolved, nearby galaxies, enabling a myriad of this paper and outlines some of the improvements we intend to 
educational and public-outreach activities include in the next version of the SGA. 

We organize the remainder of the paper in the following Note that, unless otherwise indicated, all magnitudes are on 
way: In Section 2, we define the SGA-2020 parent sample and the AB magnitude system (Oke & Schild 1970) and have not 
describe the procedure we use to define the final sample of been corrected for foreground Galactic extinction. We report all 


galaxies and their associated (angular) group membership. In fluxes in units of "nanomaggies," where 1 nanomaggie is the 
Section 3, we describe our photometric analysis, including how (linear) flux density of an object with an AB magnitude of 
we construct the custom multiwavelength mosaics, model the 
two-dimensional images of each galaxy, and measure their 
azimuthally averaged surface-brightness and color profiles. Of 
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2. Parent Sample and Group Catalog 
2.1. Building the Parent Sample 


Many of the largest, highest-surface-brightness galaxies in 
the sky have been famously known for a long time and are part 
of many of the legacy (photographic-plate) large-galaxy 
catalogs discussed in Section | (e.g, RC3). More recently, 
fainter, lower surface-briphtness (but still “large,” spatially 
resolved) galaxies have been cataloged by modern, wide-area 
optical and near-infrared imaging surveys like the SDSS, 
2MASS (Skrutskie et al. 2006), and Pan-STARRS! (Chambers 
et al. 2016). For the first version of the SGA, we opt to build 
upon this body of previous work by beginning from these and 
other catalogs of "known" large angular-diameter galaxies (but 
see the discussion in Section 6). 

Fortunately, several user-oriented databases exist, which 
curate the positions, sizes, magnitudes, redshifts, and other 
information on millions of extragalactic sources cataloged by 
different surveys, including SIMBAD (Wenger et al. 2000), the 
NASA Extragalactic Database (NED; Helou et al. 1991), and 
HyperLeda (Makarov et al. 2014). After some experimentation, 
we opt to construct the initial SGA-2020 parent sample using 
the HyperLeda'* extragalactic database. HyperLeda includes 
extensive metadata on nearly all known large angular-diameter 
galaxies, building on the heritage of the RC3 and earlier large- 
galaxy atlases. In addition, an effort has been made by the 
HyperLeda team to homogenize the angular diameters, 
magnitudes, and other observed properties of the galaxies that 
have been ingested into the database from a wide range of 
different surveys and catalogs (Paturel et al. 1997; Makarov 
et al. 2014). This procedure imposes some uniformity on our 
parent sample, although we show in Section 3.3 the significant 
value of computing the geometry (diameter, position angle, and 
ellipticity) and photometry of galaxies consistently and using 
modem (deep, wide-area) optical imaging 

With these ideas in mind, we query the HyperLeda 
extragalactic database for galaxies with angular diameter 
‘Dy(25) > 12" (0/2), where D(25) is the major-axis diameter 
of the galaxy at the 25magarcsec ^ surface-brightness 
isophote in the optical (typically the Johnson-Morgan B band; 
see Appendix A for additional details). Our query results in an 
initial parent sample of 1,436,176 galaxies, 

Using visual inspection and a variety of quantitative and 
qualitative tests, we cull this initial sample by applying the 
following additional cuts: first, we remove the Large and Small 
Magellanic Clouds and the Sagittarius Dwarf Galaxy from the 
sample by imposing a maximum angular diameter of 
DL (25) < 180. These galaxies span such a large projected 
angular size on the sky (many degrees) that their inclusion is 
outside the scope of the SGA (but see Jarrett et al. 2019). After 
removing these objects, the largest angular-diameter galaxies 
that remain are NGC0224 — M31 and NGC 0598 =M33 
whose Di (25) diameters are 178! and 62’, respectively. 

Furthermore, we limit the sample on the lower end to 
D,25)>20", which removes roughly 900,000 galaxies 
(approximately 65% of the initial sample). We implement this 
cut because we find that the fraction of sources with incorrect. 
(usually overestimated) diameters in HyperLeda incre 
rapidly below this limit. Moreover, we find that galaxies 
smaller than D; (25)= 20" are well modeled by Tractor as 
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part of the standard photometric pipeline used in LS/DR9 (Dey 
et al. 2019; D. J. Schlegel et al. 2023, in preparation). 

Next, we remove approximately 3800 galaxies with no 
magnitude estimate in HyperLeda (as selected by our query; 
see Appendix A), which we find to be largely spurious, as well 
as approximately 6500 objects with significantly overestimated 
diameters (or spurious sources), which we identify via visual 
inspection. Many of these cases are groupings of small galaxies 
or stars along a line, which have been misinterpreted by 
previous fitting algorithms as a single edge-on galaxy. In 
addition, we remove approximately 1700 galaxies whose 
primary galaxy identifier (in HyperLeda) is from either SDSS 
or 2MASS and whose central coordinates place it inside the. 
elliptical aperture of another (non-SDSS and non-2MASS) 
galaxy with Di (25) diameter greater than 30. We find that in 
the majority of cases these objects have grossly overestimated 
diameters, presumably due to shredding by the 2MASS and 
SDSS photometric pipelines. Figure 2 displays a gallery of 
some of the most common types of sources we reject from our 
initial parent sample using these cuts. 

Now, to improve the completeness of the parent sample, we 
supplement the initial HyperLeda catalog with sources draw 
from three additional catalogs, making sure to carefully handle 
duplicate entries. First, we add a subset of the Local Group 
dwarf galaxies from MeConnachie (2012), From the original 
sample of 93 galaxies in McConnachie (2012), we remove 47 
that have such a low surface brightness and are so well resolved 
that including them is beyond the scope of the current version 
of the SGA.” For reference, the median surface brightness of 
these 47 systems is jiy ^ 


28.8 mag arcsec ^, well below the 
surface-brightness completeness limit of the SGA-2020 (see 
Section 3.3). Furthermore, we remove the Fornax and Sculptor 
dwarf galaxies, which are higher surface brightness 
(uy =25.1-25.5 mag arcsec 7) but very well resolved into 
stars and star clusters and too challenging to include in this 
initial version of the SGA. 

Next, we add 190 galaxies from the RC3 and OpenNGC'* 
catalogs which are missing from our initial HyperLeda sample. 
Surprisingly, many of these systems are large and have high 
average surface brightness; however, we suspect that an issue 
with our database query (see Appendix A) may have 
inadvertently excluded these sources from our initial catalog 
‘And finally, we use the Legacy Surveys Data Release 8 (DR8) 
photometric catalogs to identify 2890 additional large-diameter 
galaxies in the Legacy Surveys footprint. ^ Specifically, after 
applying a variety of catalog-level quality cuts and extensive 
visual inspection, we include in our parent sample all objects 
(not already in our sample) from DRS with half-light radii 
Fso > 14" based on their Tractor model fits 

Our final parent sample contains 531,677 galaxies approxi- 
mately limited to D, (25) > 20/ and spanning a wide range of 
magnitude and surface brightness. In Figure 3, we show the 
celestial distribution of this sample, and in Figure 4, we show 
the range of apparent b-band magnitude.^ and angular 


T For reference, we remove the following Local Group dwarfs: Andromeda I 
1, m, V, VIL 1X, X, XI, XIL XIII XIV. XV. XVI, XVII, XVII, XIX, XX. 
XX XXII, XXII, XXIV, XXV. XXVI, XXVII, and XXIX: Anil, Aquarius 
Bootes I and I; Canes Venadci I and ll Carina: Coma Berenice: Draco: 
Hercules: Leo IV, Vs and T; Pisces T: Sagittarius dSph; Segue Tand IN; Sextans 
I; Ursa Major I and Il; Ursa Minor; and Willman 1 

P hups://github.com/mattiaverga/OpenNGC 
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WGC003084 SDSSJ211630.38+00181; 
. 1 é 


 PáCi 11208. 


Figure 2. Gallery of common types of HyperLeda sources rejected while building the parent sample. In each panel, the horizontal white Par represents 30”. The top 
row shows one spurious object and three examples o pairs or triples o stars or compact galaxies, which are recorded in HyperLeda as one "large" galaxy (represented 
hy the red ellipse). The middle row shows four examples of how bright stars and galaxies can lead to significantly overestimated galaxy diameters in HyperLeda. For 


example, PGC 2500856 in the middle right panel is a blue star, which has been miscategorized by HyperLeda as a galaxy with a majoraxis diameter of 
ulaxy PGC 3087062, which appears in the final SGA- 
etre shreds or misidentified parts of a 
2020 while the red ellipse isthe incorrect source from HyperLeda, which we remove from the parent 
PGC 071502 and PGC 093084 are HII regions in ESO240-004 and NGC 1507, respectively, while SDSSJ211630.38 018 


presumably due to the proximity of the nearby brig 
Tour examples of sources in HyperLeda, which are in fat photon 
galaxy from the SGA- 


PGC 188224 with an incorrect angular diame 


diameter spanned by the parent sample. We discuss the 
completeness of the sample in Section 4. 


Projected Group Catalog 


Our approach for the SGA is to jointly analyze galaxies that 
are relatively close to one another in terms of their projected (or 
angular) separation in order to properly fit overlapping light 
profiles. We emphasize that we do not require galaxies to be 
physically associated, which would require knowledge of their 
redshifis or physical separation. We build a simple group 
catalog from the parent sample described in Section 2.1 using 
the spheregroup friends-of-friends algorithm." We use a 10^ 
linking length, taking care to ensure that galaxies assigned to 
the same group overlap within 2 times their circularized Dy (25) 
diameter. 


T ups: /pydlreadibedocs io /en latest 


x 
10 catalog (blue ellipse), Finally, the bottom row shows 
laxy. In each case, the blue ellipse shows the correct parent 

ile (see Section 2.1). For example, 
is a foreground star in the body of 


Using this procedure, we identify 14,930 projected galaxy 
groups with two members, 1585 groups with 3-5 members, 51 
with 6-10 members, and just four groups with more than 10 
members, including the center of the Coma Cluster, the Virgo 
Cluster, and A3558 (although A3558 is outside the LS/DR9 
imaging footprint; see Section 3.3). Notably, 496.255 objects 
or 93% of the parent sample are isolated according to the 
criteria used to build the group catalog, 

For each galaxy group, we compute several quantities that we 
refer to in subsequent sections of the paper (but see Appendix B 
for the complete data model). GROUP. NAME is a unique group 
name, which we construct from the name of the group's largest 
member (ranked by Di(25) and the suffix GROUP (e. 
NGC 4406 GROUP). For isolated galaxies, GROUP. NAME is 
just the name of its only member (i.e., without the. GROUP. 
suffix). In addition, we compute GROUP. RA and GROUP. DEC to 
be the Dy 25)-weighted R.A. and decl., respectively, of all the 
group members. Once again, for isolated systems, GROUP. RA 
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Galaxy Surface Density (deg?) 


Figure 3, Distribution of 531,677 galaxies in the SGA-2020 parent sample in an equal-area Mollweide projection in equatorial coordinates, binned into 3.4 deg? 
healpix pixels (Zonca et al. 2019). The dark gray curve represents the Galactic plane. Note the significant variations in galaxy surface density, which we attribute to 
surface-brightness incompleteness and heterogeneity in the aggregate HyperLeda catalog- 
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Figure 4. Isophotal diameter, D,(25), vs. brband magnitude (left) and marginalized distribution of D,(25) (right) for the SGA-2020 parent sample. The black 
contours, points, and histogram represent galaxies from HyperLeda, while the green points and histogram are galaxies from the supplemental catalogs we use 10 
increase the completeness of the sample (see Section 2.1). For reference, the contours enclose 50%, 75%, 95%, and 99.5% of the blue points. These figures show that, 
by construction, the SGA-2020 parent sample is largely limited to D, 25) > 20" (0/333) but with a tail of galaxies with diameters as small as 10" (=0/167), and it 


includes galaxies as bright as By vou = 8 


and GROUP_DEC are identical to the RA and DEC coordinates of 
that galaxy (see Appendix B). Finally, we record our estimate of 
the diameter of the group in the quantity GROUP. DIAMETER. 
For isolated galaxies, GROUP. DIAMETER equals Dy (25), but for 
groups, we compute GROUP. DIAMETER to be the maximum 
separation of all the pairs of group members plus their D, (25) 
diameter (in arcminutes). 


3. Photometric Analysis 
3.1. Imaging Data 


We build the SGA-2020 from the same optical and infrared 
imaging data used to produce the Legacy Surveys DR9 


(Dey et al. 2019; D. J. Schlegel etal. 2023, in preparation). 
Briefly, the, optical data consist of grz imaging over 
20,000 deg? from a number of different surveys. In the 
North Galactic Cap, we use data from the Beijing-Arizona Sky 
Survey (BASS; Zou et al. 2017), which provides 5000 deg? 
of gr imaging using the 90Prime Camera (Williams et al. 2004) 
‘on the Steward Observatory Bok 2.3 m telescope at KPNO; and 
data from the Mayall z-band Legacy Survey (MzLS), which 
provides z-band imaging over the same +5000 deg* footprint 
as BASS using the Mosaic-3 camera (Dey et al. 2016) at the 
KPNO Mayall 4 m telescope. 


TE hups://www legacysurvey.org/de9 
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In the South Galactic Cap (SGC) and in the North Galactic 
Cap up to a decl. of approximately +32°, we use grz imaging 
over 215,000 deg? from roughly 50 distinct (but uniformly 
processed) data sets obtained with the Dark Energy Camera 
(DECam; Flaugher et al. 2015) at the Cerro Tololo Inter- 
American Observatory 4m Blanco telescope. Note that the 
majority of this DECam imaging comes from the DECam 
Legacy Survey (DECALS; Dey et al. 2019) and the Dark 
Energy Survey (DES; Dark Energy Survey Collaboration et al 
2016; Abbott et al. 2021).^ 

We supplement the optical data with all-sky infrared imaging 
at 3.4-22 jum from WISE (Wright et al. 2010). Specifically, we 
use the 6 yr WISE plus NEOWISE-Reactivation (NEOWISE- 
R; Mainzer et al. 2014) image stacks in WI (3.4 jm) and W2 
(4.6 im) from Meisner et al. (2021), and the W3 (12 jum) and 
W4 Q2jm) unblurred image stacks from Lang (2014); 
collectively, we refer to these custom image stacks as the 
unWISE coadds." Note that in the SGA-2020 we produce 
WISE image coadds for each galaxy (or galaxy group) in the 
sample but do not measure the infrared surface-brightness 
profiles; however, we intend to deliver the infrared (and 
ultraviolet) surface-brightness profiles and integrated photo- 
metry in a future version of the SGA (see Section 6). 


3.2, Multiwavelength Mosaics and Surface-brightness Profiles 


At this point in the analysis, we have multiband optical and 
infrared imaging covering roughly half the sky (Section 3.1) 
and an input parent catalog of central coordinates and system 
diameters for more than half a million sources (Section 2.2). 
The next steps are to build custom multiwavelength mosai 
centered on each of these positions (Section 3.2.1); model all 
the sources in the field using The Tractor (Section 3.2.2); 
and measure elliptical aperture photometry and azimuthally 
averaged surface-brightness profiles (Section 3.2.3). 

Before proceeding, we briefly summarize the principal 
software products we use, as well as the relationship between 
them. First, given an astrometrically and photometrically 
calibrated image, inverse variance image, and knowledge of 
the point-spread function (PSF), The Tractor”! uses the PSF 
and a family of two-dimensional galaxy models to forward- 
model the observed pixel-level data (Lang et al. 2016; Dey 
et al. 2019; D. Lang et al. 2023, in preparation), One of the 
principal advantages of The Tractor is that it handles 
multiband, multi-CCD, variable-PSF imaging in a statistically 
rigorous way, which is especially important when dealing with 
the full range of optical and infrared (WISE) data from the 
Legacy Surveys. In order to handle the imaging data, the 
Legacy Surveys team has developed legacypip: 
photometric pipeline that wraps The Tractor as its fitting 
engine and conveniently handles many tasks related to this 
imaging data set (Dey et al, 2019). Finally, for the SGA project 
specifically, we have developed the SGA™ and legacyha- 
1os^ software products, which rely on some of the lower- 
level legacypipe functionality (with Message Passing 


See Dey et al. (2019) for a complete list of the DECam programs used. 
20 hp: /unwise me 

2" ps: /github com /dstndstn tractor 

23 hips; /gihub.com /legacysurvey/legacypipe 

3 On GitHub: hups://sithub.com/moustakas/SGA with version 01 
deposited to Zenodo (Moustakas et al. 2023). 


5 On GitHub: hups://zithub.com/moustakas/legacyhalos with version 1.4 
deposited to Zenodo (Moustakas & Lang 2023} 
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Interface parallelization) but also include code to carry out 
the nonparametric photometric analysis, which is one of the 
comerstone data products of the SGA-2020 (see Section 3.3). 


3.2.1. Mosaics 


Given the diameter of each system and its central coordinates 
(GROUP RA, GROUP DEC, and GROUP DIAMETER: see 
Section 2.2), we first determine if LS/DR9 imaging exists in 
all three grz bands over at least 90% of the area. If not, we 
remove that system from further analysis (including, unfortu- 
nately, NGC 0224 = M31, where we only have MzLS z-band 
imaging). Then, for the remaining objects, we generate g-, r=, 
and z-band mosaics with a size that depends on the group size: 
for groups with GROUP DIAMETER < 14’, we generate a 
mosaic of diameter 3 x GROUP. DIAMETER: for groups with 
14! < GROUP DIAMETER < 30’, we generate a mosaic of 
diameter 2 x GROUP. DIAMETER; and for NGC 0598 = M: 
whose GROUP. DIAMETER is >30', we use a mosaic diameter. 
of 1.4 x GROUP. DIAMETER. We also choose the input imaging 
according to the following criteria: we use the DECam imaging 
(from DECaLS and DES) for all of the SGC, and for the North 
Galactic Cap when GROUP. DEC < 327375; and the BASS plus 
MzLS imaging otherwise (see Section 4.1.3 of Myers 
et al. 2023). 

Our analysis begins with reduced and calibrated CCD-level 
90Prime, Mosaic-3, and DECam imaging. These reduced data 
are generated using the NOIRLab Community Pipeline (CP) 
dedicated to each instrument (e.g., Valdes et al. 2014), together 
with several custom data-reduction steps developed by the 
Legacy Surveys team, We refer the reader to Dey et al. (2019) 
and D. J. Schlegel et al. (2023, in preparation) for a detailed 
description of these data-reduction procedures. Briefly, we use 
Pan-STARRSI PSF photometry (Chambers et al. 2016; 
Finkbeiner et al. 2016) transformed to the natural filter system 
of each instrument for photometric calibration” and Gaia Data 
Release 2 (Gaia Collaboration et al. 2018) stellar positions for 
astrometry.” The average photometric precision for bright (but 
unsaturated) stars is better than +10 mmag in grz over the full 
footprint, and the astrometric precision is approximately 
40/030 for DECam and Mosaic-3 and +0712 mas for 
90Prime (Dey et al. 2019), 

Accurate large-galaxy photometry depends crucially on 
robust and unbiased background-subtraction. For the SGA- 
2020, we utilize the same background-subtracted images used 
for LS/DR9. As we discuss below and in Section 3.3, however, 
the sky-subtracted data do contain systematic errors, which we 
intend to mitigate in future versions of the SGA (see the 
discussion in Section 6). 

First, the CP carefully masks astrophysical sources and then 
subtracts the large-scale sky-pattern across the field of view 
from each exposure using a low-order spline model derived 
from robust statistics measured on the individual CCDs. We 
mote that the 90Prime, Mosaic-3, and DECam CCDs are 
approximately 30' x 30’, 17!5 x 17/5, and 9' x 18’, respec- 
tively, so the angular scale of this background model is much 
larger than all but the largest galaxies in the SGA-2020. Next, 
the CP subtracts the camera reflection pattern (or pupil ghost) 
from the DECam and Mosaic-3 data and the fringe pattern from 
the Mo: and and 90Prime r-band data, Telescope 


ugs. Jwwvclegacysur y org d description photometry 
25 bups: / /www-legacysurvey-org /dr9 /description/ #astrometey 
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reflections from very bright stars are not removed. Next, the CP 
aggressively subtracts the high-frequency pattern noise (caused 
by a drifting amplifier bias level) present in the Mosaic 
imaging. The pattern fitting was designed to preserve counts in 
smaller galaxies and stars but, unfortunately, has a significant 
effect on the low-surface-brightness outer envelopes of the 
galaxies in our sample. This pattern-noise subtraction affects all 
the MzLS imaging and causes galaxies to appear 100 green in 
the grz color mosaics (see Appendix C). For DECam, we also 
subtract a residual g-, r-, and z-band sky pattern and a z-band 
fringe pattern from the data using median-scaled templates 
derived from multiple exposures (in a given bandpass) within 
‘one or more nights." Finally, we remove the spatially varying 
sky-backeround on the smallest scales by dividing each CCD 
into 512 pixel boxes, computing the robust median, and using 
spline interpolation to build the final background map. During 
this step, we mask pixels that lie within the elliptical aperture of 
any galaxy in the SGA-2020 parent catalog (from Section 2.1), 
as well as Gaia stars and other sources detected in each image 

Finally, with all the reduced data in-hand, we build the full- 
field mosaic for each galaxy (or galaxy group) as the inverse- 
variance weighted sum of all the available imaging (in each 
bandpass) projected onto a tangent plane using Lanczos 
(sinc) resampling. For the grz imaging, we adopt a constant 
pixel scale of 07262 pixel‘, and for the unWISE mosaics, we 
use 2775 pixel” !. The left panels of Figures 5 and 6 show, as 
examples, the gre color mosaics for the isolated galaxy 
NGC 5016 and PGC 193192, a member of the PGC 193199 
Group, respectively. 


322. Tractor Modeling and Masking 


We use The Tractor to model all the sources in a given 
mosaic, including the large angular-diameter galaxies of 
interest, Note that all source detection and model fitting with 
The Tractor takes place on these coadded images (triggered 
by invoking the --£it-on-coadds option in legacy- 
pipe), unlike for the standard DR9 processing in which all 
model fitting is done using the unresampled CCD images 
jointly (see Dey et al. 2019; D. J. Schlegel etal. 2023, in 
preparation), 

One issue is that The Tractor tends to fit the high-surfac 
brightness central regions of large galaxies at the expense of 
their outer envelopes, creating undesirable systematic residuals 
in their outer parts. This problem occurs despite the larger 

ariance—smaller inverse variance—in the central parts from 
source Poisson noise. Therefore, before fitting, we multiply the. 
optical inverse variance mosaics (1) by a factor of J7/I5o. 
where Zso is the median inverse variance of the whole mosaic. 
This scaling tends to reduce / in the central part of the galaxy 
and increase it in its outer regions, generally leading to better 
modeling results. 

In addition, we increase the threshold for detecting and 
deblending sources by specifying --saddle-fraction 0.2 
and --saddle-min 4.0 (the default values are 0.1 and 2.0, 
respectively). The saddLe-fraction parameter controls the 
fractional peak height for identifying new sources around 
existing sources, and saddle-min is the minimum required 
saddle-point depth (in units of the standard deviation of pixel 
Values above the noise) from existing sources down to new 
sources. We find these options necessary in order to prevent 


77 ps; /www degacysunvey o 
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excessive shredding and overfiting of the resolved galactic 
structure in individual galaxies (e.g., H II regions). Finally, The 
Tractor detects sources, creates a segmentation map, and then. 
uses the mean PSF of the coadd to compute the two- 
dimensional, maximum-likelihood model of each source (fitting 
all three grz bands simultaneously) from among the following 
possibilities: PSF, REX, EXP, DEV, or SER. For reference, we 
construct the PSF of the coadd as the inverse-variance weighted 
average PSF of the individual pixelized PSFs contributing to 
the coadd, which is sufficient given that the galaxies we are 
interested in, D, 5) = 20", are significantly larger than the 
optical image quality, PSFpwnm = 1-27. 

The middle left panels of Figures 5 and 6 show The 
Tractor model image stack for NGCS016 and the 
PGC 193199 Group, respectively. Overall, the model is an 
excellent description of the data, particularly for the small, 
compact sources in the field. However, note how The 
Tractor fits the (resolved) spiral arms in NGC 5016 as 
elongated blue “galaxies,” and the extended outer envelopes of 
the early-type galaxy models in the PGC 193199 Group 
compared to the data. Despite these issues, The Tractor 
models of the large galaxies in our sample are extremely useful 
and complementary to the nonparametric photometric measure- 
ments we carry out in Section 3.2.3 

Using The Tractor models, we next build an image mask, 
which we use in Section 3.2.3, First, we read the maskbi ts?” 
bit-mask image produced as part of the pipeline, but only retain 
the BRIGHT, MEDIUM, CLUSTER, ALLMASK G, ALL- 
MASK R, and ALLMASK 2 bits. Hereafter, we refer to this 
mask as the starmask. Next, we build a residual mask, 
which accounts for statistically significant differences between. 
the data and the Tractor models. In detail, we flag all pixels 
that deviate by more than So Gn any bandpass) from the 
absolute value of the Gaussian-smoothed residual image, which 
we construct by subtracting the model image from the data and 
smoothing with a 2 pixel Gaussian kernel. This step obviously 
masks all sources, including the large galaxies of interest, but 
We restore those pixels in the next step. In addition, we 
iteratively dilate the mask two times and mask the pixels along 
the border of the mosaic with a border equal to 2% the size of 
the mosaic. 

Then, we iterate on each galaxy in the group from brightest 
to faintest based on The Tractor r-band flux and carry out 
the following steps: (1) For each galaxy, we construct the 
model image from all Tractor sources in the field except the 
galaxy of interest and subtract this model image from the data. 
(2) We measure the mean elliptical geometry of the galaxy 
(center, ellipticity, position angle, and approximate semimajor 
axis length) based on the second moment of the light 
distribution (hereafter, the ellipse moments) using a 
modified version of Michele Cappellari’s mge.find ga- 
laxy" algorithm (Cappellari 2002). When computing the 
ellipse moments, we first median-filter the image with a 
3 pixel boxcar to smooth out any small-scale galactic structure, 
and we only use pixels with ji, < 27 mag arcsec *, (3) Finally, 
we combine the residual mask with the starmask (using 


75 Briefly, REX isa round (c = O) exponential galaxy model with variable half- 
light radius; EXP and DEV represent an exponential and de Vaucouleurs (1948) 
galaxy profile, respectively: and SER is a Sersic (1968) galaxy model (see Dey 
etal. 2019 and the LS/DR9 documentation for more details). 

29 ups: //www legacysurvey.org /dr9 /bitmasks/#maskbits 

30 big: /vwwastro physics ox c.i /-mxc/software/ imge 
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or one example galaxy, NGC 5016. The three panels on the left band side show 
of the corresponding Tractor model image; and (bottom left) the r-band image with 
masked pixels zeroed out (white pixel values). The nested black ellipses in this panel correspond to nine surface-brig 
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Boolean logic), but we unmask pixels belonging to the galaxy 
based on the ellipse moments geometry using 1.5 times 
the estimated semimajor axis of the galaxy. 

Occasionally, the preceding algorithm fails in fields contain- 
ing more than one galaxy if the central coordinates of one of 
the galaxies are masked by a previous (brighter) system. We 
consider a source to be impacted if any pixel in a 5 x 5 pixel 
box centered on The Tractor position of the galaxy is 
masked, In this case, we iteratively shrink the elliptical mask of 
any of the previous galaxies until the central position of the 
galaxy currently being analyzed is unmasked. We emphasize 
that this algorithm is not perfect, particularly in very crowded 
galactic fields like the center of the Coma Cluster, but we 
intend to improve it in future versions of the SGA. Another 


brightness 
Iriagles) measured within the same elliptical apertures shown in the lower le panel. The dash 


tively, and the bottom ri 
(independently fit to the photometric data in each bandpass) given by Equation (1). The legend in the lower 
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ii panel also shows the best-ftting curve-of-growth models 

! panel also provides the integrated (asymptotic) grz 


occasional failure mode is if the flux-weighted position of the 
galaxy based on the ellipse moments differs by The 
Tractor position by more than 10 pixels, which can happen 
in crowded fields and near bright stars and unmasked image 
artifacts; in this case, we revert to using The Tractor 
coordinates and model geometry. 

The bottom left panels of Figures 5 and 6 show the final 
masked r-band image for NGC5016 and PGC 193192, 
respectively. 


4.2.3. Surface-brightness Profiles 


With the multiwavelength mosaics and per-galaxy image 
masks in-hand, we next measure the surface-brightness profiles 
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Figure 6. Like Figure 5, but for PGC 193192, the second-largest member of the PGC 193199 Group. 


and photometric curves of growth for each galaxy in the sample. 
using the standard ellipse-fiting and aperture photometry 
techniques in photutils*! (Bradley 2023). We assume a 
fixed elliptical geometry as a function of semimajor axis using 
the ellip measured in Section 3.22, and 
robustly determine the surface brightness along each elliptical 
path from the light-weighted central pixel to 2 times the 
estimated semimajor axis of the galaxy in a 1 pixel (0/262) 
interval. In detail, we measure the surface brightness (and the 
uncertainty) using two c clipping iterations, a 3c clipping 
threshold, and median-area integration." 

From the r-band surface-brightness profile, we also robustly 
measure the size of the galaxy at nine equally spaced surface- 
brightness thresholds between ji, = 22 and 26 mag arcsec ^. We 
perform these measurements by fitting a linear model to the 
surface-brightness profile converted to mag arcsec ^ versus r^ 


moments 


ST pups /photutils readihedocsio/en/stable 
32 In other words, we use nclipo2, = 
dian, as documented in the phi 
image method. 


and integrmod. 
lips 


10 


(which would be a straight line for a de Vaucouleurs galaxy 
Profile, but only consider measurements that are within 
+1 mag arcsec? of the desired surface-brightness threshold 
To estimate the uncertainty in the resulting radius, we generate 
30 Monte Carlo realizations of the surface-brightness profile and 
use the standard deviation of the resulting distribution of radii 

We also measure the curve of growth in each bandpass using 
the tools in photutils aperture. Briefly, we integrate 
the image and variance image in each bandpass using elliptical 
apertures from the center of the galaxy to 2 times its estimated 
semimajor axis (based on the ellipse moments), again 
with a 1 pixel (07262) interval. We fit the resulting curve of 
growth, m(r), using the following empirical model 


mi) = ma + molog, | + x37] 


* Unfortunately, our original elliptical aperture photometry had an 
irrecoverable bug, so in the final release of the SGA-2020, we infer the 
aperture photometry from the surface-brightness profiles: see Appendix C for 
details. 
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Figure 7. Distribution of 383,620 galaxies in the final SGA-2020 sample (to be compared with Figure 3, but note the different color-bar scales). 


where Mor, mo, à, 2, and ro are constant parameters of the 
model, and r is the semimajor axis in arcseconds. In our 
analysis, we take the radius scale factor r= 10" to be fixed 
(which makes o, dimensionless). Note that, in the limit 
r— 56, my, is the total, integrated. magnitude. Using this 
model, we infer the half-light semimajor axis length, rsp, 
analytically from the best-fitting model parameters: 


-1yo 
L fa fa f ono) " 
ra= of [x 04m; 1 . (2) 


where rs, is measured in arcseconds, and my is in magnitudes. 


3.3. Summary and Validation of SGA-2020 Data Products 


The final SGA-2020 sample consists of 383,620 galaxies in 
the 220,000 deg? LS/DRO imaging footprint. The final catalog 
contains precise coordinates; multiwavelength mosaics; model 
images and photometry from The Tractor; azimuthally 
averaged optical surface-brightness profiles; aperture photo- 
metry and radii; and extensive metadata for all galaxies in this 
sample. In this section, we briefly highlight some of these 
measurements; for a comprehensive description of the SGA- 
2020 data products and how they can be accessed, see 
Appendix B. 

Figure 7 shows the celestial positions of the galaxies in the 
SGA-2020 in an equal-area Mollweide projection. The curved 
black line represents the Galactic plane, which divides the 
sample into the North Galactic Cap and SGC imaging regions 
of the LS/DRO footprint (see Dey et al. 2019), The total area 
subtended by the sample is 19,721 deg’, covering nearly 50% 
of the sky. 

In Figure 8, we highlight a handful of the measured SGA- 
2020 properties, as well as the relationships between them. In a 
multivariate corner plot (Foreman-Mackey 2016), we show rg. 
(e. the r-band magnitude within RQ6), where R26) is 
the semimajor axis at the 26magarcsec ^ isophote; 
(i, ga) = rras + 2.5 logy [7R°(26)]}, the average r-band 
surface brightness within R(26); ( is the galaxy position angle 


u 


(measured counterclockwise from north to east); and 
1— b/a, the galaxy ellipticity, where b/a is the minor-to- 
major axis ratio. Note the expected strong correlation between 
R(26) and rros, and the anticorrelation between c and (ji). 
5». Which is due to the light in more edge-on galaxies being 
attenuated more by the larger column of intemal dust 
attenuation (famously known as the Holmberg “transparency 
test”; Holmberg 1958; Giovanelli et al. 1994). 

In Figure 9, we compare some of the new geometrical 
measurements in the SGA-2020 against the measurements 
collated in HyperLeda. In panel (a), we plot Adecl. versus 
AR.A., the difference in equatorial coordinates. The overall 
agreement in positions is very good; the median, mean, and 
Ela scatter are 073, 074, and +0”5, respectively, although the 
coordinates of individual (especially irregular and low-surface- 
brightness) galaxies differ by up to tens of arcseconds. In 
Figure 10, we show a randomly selected set of 20 galaxies 
where the central coordinates in the SGA-2020 and HyperLeda 
differ by more than 3. Although reasonable algorithms may 
disagree about the central positions of some galaxies due to 
dust lanes or the lack of a prominant central bulge (e.g. 
NGC 4948 = IC 4156 and ESO293-034), it is clear that the 
coordinates in the SGA-2020 for most of the examples 
highlighted in Figure 10 are more accurate in measuring the 
centroid of galaxies with bright cores and the center of light for 
is, even for very large, well-known galaxies 
UGC 07878, and NGC 3521 = UGC 06150. 

Next, in Figure 9(b), we plot Di 25) (defined in Section 2.1) 
versus D(25), the SGA-2020 major-axis diameter measured at 
the 25 magarcsec ^ isophote. Not unexpectedly, we find a 
strong correlation between the two quantities, although DOS) 
is 20% larger, on average, than D, (25), presumably due to 
the deeper optical imaging used in the SGA-2020. Finally, in 
Figures 9(c) and (d), we plot the correlation between position 
angle, ó, and ellipticity, €, respectively, between HyperLeda 
and the SGA-2020. For more than 95% of the sample, the 
agreement between the two ó and e measurements are 
excellent 
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Figure 8. Multivariate distribution of a subset of measured SGA-2020 galaxy properties. From left to right along the bottom panels, we show ri 


(degree) 


y. the r-band 
isophote; (jj ac) the mean surface brightness within R(26); o, the galaxy 


position angle; and r, the galaxy ellipticity. The contours enclose 50%, 75%, 95%, and 99.5% of the blue points and have been smoothed by a 0.8 pixel Gaussian 


kemel, 


In order to validate our measurements further, in Figure 11, 
we compare the SGA-2020 coordinates and mean geometrical 
measurements against the WISE Extended Source Catalog of 
the 100 Largest Galaxies (hereafter, WXSC-100; Jarrett et al. 
2019). Of the 104 galaxies in the WXSC-100 sample, 59 are 
in the SGA-2020. Note that WISE imaging is less affected by 
dust extinction and more sensitive to the underlying spatial 
distribution of the older stellar population compared to our 
optical imaging, so we do expect some differences in these 
measurements. 

Focusing on Figure ll(a) first, we find good overall 
agreement in the central coordinates: the median, mean, and 
+ lo scatter in the coordinate differences are 0"76, 23, and 
22/4, respectively, notably smaller than the ~6" FWHM 
WISE WI PSF (Wright et al. 2010). However, there are some 
notable outliers, the three largest of which (NGC 0247, 


V aps; /vislab dia e-7a/research-wase 
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NGC 2403, and NGC 4395) have been annotated in the 
figure. Examining the optical and infrared mosaics of these 
and other sources in this comparison sample, we find that 
the differences are due to a combination of a lack of a 
distinct bright center and, for a handful of cases, errors in the 
SGA-2020. For example, both NGC 2403 and NGC 0247 
are late-type spirals (Hubble type 6 or SABe; see de 
Vaucouleurs et al, 1991) without a clear, bright nucleus, so 
the differences are arguably defensible. On the other hand, for 
NGC4395 and a handful of other objects, the SGA-2020 
central coordinates are likely incorrect (by up to ~7") relative 
to those published in the WSXC-100 (see Section 6 for 
additional discussion). 

Turing next to Figure 11(b), we compare D(26) in the 
SGA-2020 to 2 x yi in the WXSC-100; 2 times the radius 
of the galaxy measured down to an isophotal level of 
fiwi an 25.7 magarcsec 7. We find the two sizes to be 
reasonably well correlated, despite the differences in imaging 
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Figure 9. Comparison of select observed properties reported in HypeLeda against the newly measured quantities in the SGA-2020. (a) Difference in central 
coordinates: (b) Dy(25) vs. DOS, the major-axis diameter measured at the 25 mag arcsec * isophote: (c) galaxy poston angle, ó: and (d) galaxy ellipticity. 


ba, where bja is the minor-to-major axis ratio. In every panel, the contours enclose 50%, 75%, 95%, and 99.54% of the blue points and have been smoothed 


bby a 0.8 pixel Gaussian Kernel. In panels (b). (c), and (d), the solid red line represents the one-to-one relation. 


effective wavelength, albeit divided into two broad sequences. 
The sizes of the spirals (Hubble types 2 to 3—Sab to Sb) and 
later generally follow the one-to-one relation (median 
difference of —0.004 dex) with a scatter of 0.11 dex (30%), 
while the earlier-type, spheroidal galaxies (Hubbles types —5 to 
1—E to Sa) are offset to larger sizes in the WXSC-100 by 
—025 dex (27096) with a scatter of 0.16 dex (50%). As 
discussed by Jarrett et al. (2019), the WISE W1 band is very 
sensitive to light from evolved stars down to low surface- 
brightness levels (particularly due to its large pixels, 2/75), so 
it is not surprising for the WSXC-100 (infrared) diameters to be 
notably larger than the (optical, band) D(26) diameters in the 
SGA-2020. Moreover, the extended, low-surface-brightness 
envelopes of massive spheroidal galaxies are especially prone 
to oversubtraction (e.g. Blanton et al. 2011; Bernardi et al. 
2013), which may also be contributing to the systematically 
smaller sizes of the early-type galaxies in the SGA-2020 
relative to the WSXC-100 (see also Section 6). 
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Finally, in Figures 11(c) and (d), we compare the position 
angles and ellipticities reported in the SGA-2020 and WXSC- 
100 catalogs, respectively, and find very good agreement: the 
mean differences are ósc — Ówxsc — 083-727, and 
tsaa — ewxsc = 0.017 + 0.12. 


4. Catalog Completeness 


Quantifying the completeness of the SGA-2020 is difficult 
because the parent sample is largely defined by HyperLeda (see 
Section 2.1), which aggregates data from many different 
surveys—each with their own potentially complicated selection 
function—from the ultraviolet to the radio. This heterogeneity 
can be clearly seen in Figure 7 as variations in the sample 
surface density on both large and small angular scales and 
between the North Galactic Cap and SGC. Indeed, as we 
discuss in Section 6, one of the primary goals of the next 
version of the SGA is to redefine the parent sample using the 
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Figure 10. Randomly selected gallery of 20 galaxies where the li 


Legacy Surveys imaging itself, in order to begin with a more 
uniform and quantifiable selection function over the full area. 
Nevertheless, we can still assess the SGA’s completeness by 
comparing against existing catalogs that include large angular- 
diameter galaxies. 

We choose to characterize the completeness of the SGA- 
2020 by comparing against HECATE (version 1.1; Kovlakas 
et al, 2021).” HECATE is an all-sky value-added catalog of 
2:200,000 galaxies at z < 0.047 ($200 Mpc), which contains a 
breadth of both observed and derived (physical) galaxy 
properties. Although HECATE also uses HyperLeda to define 
its parent sample, the comparison is still valuable because the 
analysis carried out by Kovlakas et al. (2021) is independent of 
ours, and it includes additional completeness checks against 
NED and against the local B-band luminosity function. 

First, from the parent HECATE sample of 204,733 galaxies, 
we identify 154,093 objects (75%) to be within the LS/DR9 
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-weighted central coordinates measured in the SGA-2020 differ by more than 3" from the 
‘coordinates published in HyperLeda, sorted by increasing D (25) from the upper left to the lower right, The white bar in the lower left comer of 

30", and the blue and red cross-haired ellipses represent the SGA-2020 and HyperLeda positions and mean geometry, respectively. Although the c 
these systems are somewhat ambiguous due to dust lanes and other irregular features (e.g. NGC 4607), the S 


ch panel represents 
ers of some of 


3A-2020 coordinates are generally superior. 


imaging footprint using the LS/DR9 random catalogs, which 
contain a wealth of information about the imaging data 
(bandpass coverage, depth, PSF size, etc.) at random positions 


over the footprint (Myers et al. 2023)."^ Specifically, we merge 
together five random catalogs to achieve an effective source 
density of 12,500deg 2, and conservatively retain all 


HECATE objects whose center lies within 2’ of one of these 
points. Next, we remove 2844 without any size 
information in HECATE; visually inspecting a random subset 
of these reveals that they are predominantly small objects, D 
(25) « 30", well under the SGA-2020 angular diameter limit 
(Figure 4). Finally, from the remaining 151,249 objects, we 
match 95,800 (63%) of them to an object in the SGA-2020 
using the HyperLeda PGC number (Paturel et al. 1989 
Makarov et al. 2014), which both HECATE and the SGA- 
2020 record (where it is defined), We choose to use the PGC 
designation because the differences in coordinates can be 
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Figure 11. Comparison of the (a) central coordinates: (b) diameters: (c) position angles, o: and (d) ellipticities, c measured in the SGA-2020 against those reported in. 


the WXSC-100 (Jarrett et al. 2019) for an overlapping sample of 59 galaxies. Individual galaxies are color coded by numerical Hubble type, from 


5 (E) 10 9 (lir) (see 


de Vaucouleurs et al. 1991), as indicated by the color bar. See the text in Section 3.3 for the definition of the diameter measured in the WXSC-100 catalog and a 


detailed discussion of the observed trends. 


significant, such that a single matching radius results in a 
nonnegligible number of false-positives. For example, among 
the PGC-matched samples, the mean difference in coordinates 
is 0750-4092, but with a tail that extends out to 80" for 
1C 2574, a well-known 18! 7-diameter late-type (SABm) galaxy 
with no well-defined center. In fact, 126 PGC-matched galaxies 
have coordinate differences larger than 10”, 90% of which have 
D(26) > 1', Nevertheless, with these caveats in mind, we can 
still match an additional 178 objects using a 3” matching 
radius, resulting in a final matched sample of 95,978 galaxies. 

In Figure 12, we plot b-band magnitude versus 2 x R, for 
the 154,093 HECATE galaxies, which overlap the LS/DR9 
footprint, where R, is the semimajor axis length reported in 
HECATE. The dashed red histogram (corresponding to the 
right-hand axis) shows the raw fraction of matching HECATE- 
SGA galaxies as a function of galaxy size, in uniform 0.2 dex 
wide bins of angular diameter between ~2" and «158. The 
small numbers written above each bin of the histogram report 
the number of HECATE galaxies in that bin. In addition, the 
horizontal gray line indicates, for reference, a matching fraction 
of 100%. 

Taken at face value, the (raw) matching fractions shown in 
Figure 12 are surprisingly poor. For example, these results 
suggest that the SGA-2020 is missing 3%-5% of galaxies with 
angular diameters between 2/5 and 10’ and a whopping 20% of 
the galaxies between 10’ and 16’. In total, we find 2.94% (367/ 
12487) of the HECATE galaxies with 2 x R > 1' to be 
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missing from the SGA-2020. To explore this purported 
incompleteness, we visually inspect the LS/DR9 imaging at 
the position of the 367 “missing” galaxies and find the 
following results: 20 objects are Local Group dwarf galaxies, 
which are intentionally excluded from the SGA-2020 (see 
Section 2.1); 44 objects fall on the edge of the imaging 
footprint or other serendipitous (but unfortunate) gaps in three- 
band (grz) coverage; 64 galaxies are real, but the angular 
diameters reported in HECATE are overestimated, sometimes 
by a significant factor; 78 are part of a larger galaxy (e.g., H I 
regions) and other kinds of photometric shreds; and 24 are 
entirely spurious. The solid purple histogram in Figure 12 
shows the corrected fraction of HECATE-SGA matches after 
accounting for these errors. In the end, we find that just 137 out 
of 12,257 (1.12%) HECATE galaxies with 2 x R > 1 are real 
and genuinely missing from the SGA-2020. 

Although the fraction of missing galaxies is relatively low, 
since both the SGA-2020 and HECATE ultimately originate (in 
large part) from HyperLeda, it is surprising that any objects are 
missing from the SGA-2020, especially ones with angular 
diameters larger than 1, Although we do not know why these 
objects do not appear in our parent HyperLeda catalog, we 
suspect that an issue with the database query may be ultimately 
responsible (see Appendix A). In any case, we intend to ensure 
that these and other missing objects serendipitously identified 
by the SGA team through visual inspection are included in the 
next version, as we discuss in Section 6. 
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major axis length in HECATE. For reference, the contours enclose 10%, 25%, S0%, 75%, 95%, and 99% of the points. The 


ddashed-red and solid purple histograms (corresponding to the right-hand vertical axis) represent, respectively. the raw and corrected fraction of HECATE galaxies, 
Which match a source in the SGA-2020 galaxy as a function of angular diameter, Finally. the small numbers written above each bin of the histograms report the 


number of HECATE galaxies in that 0.2 dex wide bin. 


What about the completeness of the SGA-2020 among 
smaller  angulardiameter galaxies, 25" £2 x Ri < 1? 
‘According to Figure 12, the completeness remains relatively 
high, above 29066. However, our analysis of the >1’-diameter 
galaxies reveals that more than half of the HECATE objects 
missing from the SGA-2020 either have incorrect (or 
overestimated) diameters, or are spurious, and we have checked 
that these and other effects increase steeply with decreasing 
angular diameter (see, for example, Figure 2). Therefore, we 
conclude that the SGA-2020 completeness is likely 295% for 
galaxies with angular diameters between 2-25" and 1’. 

None of this discussion, of course, addresses the surface- 
brightness incompleteness of the sample, since both HECATE 
and the SGA-2020 inherit whatever incompletenesses and 
heterogeneities are present in HyperLeda, which aggregates 
data from many different surveys. For example, regions of the 
sky that have been imaged by the SDSS (Strauss et al. 200: 
Blanton et al. 2011) or DES (Dark Energy Survey 
Collaboration et al. 2016; Abbott et al. 2021) have uniform, 
deep optical imaging (4, sọ < 24.5 and ji, < 25.6 mag arcsec ^ 
in the SDSS and DES, respectively, where ji, so is the rband 
half-light surface brightness, and the DES surface brightness is 
measured in a 1795 diameter aperture), but these surveys cover 
just 34% (SDSS; 14,000 deg?) and 12% (DES; 5000 deg?) of 
the sky. Other optical and near-infrared surveys like 2MASS 
(arrett et al. 2000; Skrutskie et al. 2006) and Pan-STARRSI 
(PSI; Chambers et al. 2016) cover all or nearly all of the sky 
(100% and 75% for 2MASS and PSI, respectively): however, 
2MASS is relatively shallow compared to these other surveys 
(j, = 22.7 mag arcsec ^ assuming a median r — K, = 2.7 color 
for low-redshift galaxies; Jarrett et al. 2019) while the 
photometry of bright, large angular-diameter galaxies in PSI 
is known to be problematic (Magnier et al. 2020; Makarov 
et al, 2022), In other words, it is difficult to fully assess the 
incompleteness of the SGA-2020 given the variations in the 
completeness of the surveys that contribute to HyperLeda. 
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Nevertheless, we can still make some quantitative statements 
using the results shown in Figure 8 and the comparisons with 
HECATE, above. Based on the correlation between the mean 
surface brightness, (jege), and the apparent brightness 
within the 26 mag arcsec * isophote, rgo We conclude that 
the SGA-2020 is approximately 95% complete for galaxies 
with R(26) > 25", rroo $ 18, and (ji p20) £ 26 mag arcsec ^; 
in addition, the SGA-2020 is more than 99% complete for 
galaxies larger than 1’ and brighter than ra) $ 16 down to the 
same surface-brightness limit, 


5. Scientific Applications 


As discussed in the Introduction, atlases of large angular- 
diameter galaxies have played a pivotal role in observational 
cosmology and in our modern understanding of galaxy 
astrophysics and the galaxy-halo connection. By delivering a 
carefully constructed catalog of known "large" galaxies with 
new deep optical and infrared imaging from the DESI Legacy 
Imaging Surveys, we anticipate the SGA-2020 to play a 
commensurately high-impact role in a wide range of 
observational studies of large, nearby, well-resolved galaxies. 

The growing spectroscopic data set from DESI is an 
especially powerful complement to the SGA-2020. As 
discussed in Section 1.2, DESI is targeting the SGA-2020 
sample over 14,000 deg? (70% of the LS/DR9 footprint) as 
part of the flux-limited (r < 20.175) BGS (Hahn et al. 2023). 
In addition to providing precise spectroscopic redshifts, the 
spectral coverage (3600-9800 A), instrumental resolution 
(R ~ 2000-5000), and spectrophotometric precision (2%) 
of the DESI spectra (Abareshi et al. 2022; Guy et al. 2023) will. 
yield important insights into the physical conditions and stellar 


77 Tn detail, BGS is observing all galaxies brighter than r = 19.5 (BGS Bright) 
and a colorsclected subset of galaxies with 19.5 < 7 < 20.175 (BGS Faint): 
the color selection is tuned to ensure good redshift success at this fux level in 
right sky conditions. 
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Figure 13. Left: celestial distribution of galaxies inthe SGA-2020, which are also DESI targets (color bar) covering ~14,000 deg? (Myers et al. 2023) compared to the 


Tull 20,000 deg? footprint of the SGA-2020 (dark gray regi 
sedshifts fror 


in the SGC). Right: redshift distribution of galaxies in the SGA-2020 with existing spectroscopic 
HyperLeda (blue histogram) and from the DESI Early Data Release (EDR; orange histogram). The legend in this figure also indicates the approximate 


final number of SGA-2020 galaxies that DESI will observe by the end of the 5 year Main Survey 


populations of the central regions of these systems (for recent 
reviews, see Conroy 2013; Kewley et al. 2019; Sánchez 2020, 
and references therein). 

Figure 13 illustrates the tremendous scale of the DESI data 
set. Although the majority of galaxies in the SGA-2020 have 
previously measured redshifts in HyperLeda (264,865 galaxies, 
or approximately 70% of the sample), these redshifis come 
from a wide range of different surveys spanning many decades. 
By the end of its 5 year Main Survey (2021-2026), DESI will 
produce a homogeneous, high-precision spectrophotometric 
data set for more than 300,000 SGA-2020 targets over 
14,000 deg? as part of a larger sample of approximately 14 
million BGS targets and more than 25 million fainter 
extragalactic targets (DESI Collaboration et al. 2023b; Myers 
et al. 2023), Indeed, in its 2023 June Early Data Release 
(EDR), DESI has already delivered high-quality spectroscopy 
for nearly 7700 SGA-2020 targets, which were observed during 
the DESI Survey Validation period (DESI Collaboration et al 
20234). 

In addition to these Main Survey observations, one of the 
especially exciting and synergistic secondary DESI programs is 
the Peculiar Velocity survey (Saulder et al. 2023)." This 
program aims to use the fundamental plane and Tully-Fisher 
relations as direct distance indicators in order to map the 
peculiar-velocity field at z< 0.15; this map will be used to 
place new, stringent constraints on the cosmological parameters 
and the growth of large-scale structure (e.g., Strauss & Willick 
1995). As part of this effort, galaxies in the SGA-2020 are 
being targeted not only in their bright nucleus but also at 
various positions along their major axis and other “off-centei 
positions (Saulder et al, 2023; K. A. Douglass et al. 2023, in 
preparation), Together with the central spectra, these data will 
help constrain the total (dynamical) masses and physical 
conditions in a sample of tens of thousands of SGA-2020 
galaxies. 

‘The SGA-2020 also has the potential to support the growing 
number of time-domain and multimessenger astronomic: 
discoveries, wherein the observations of transient astrophysical 
events are detected by one or more messenger particles 


57 ups /Mata.desi.Ibl.gov /doc releases ede. 

39 Secondary programs are bespoke scientific programs, which utilize 
Whatever spare fibers may be available in a given DESI tle or pointing of 
75000 fibers (see Myers et al. 2023 for details). 
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(electromagnetic radiation, neutrinos, cosmic rays, and 
gravitational waves; e.g., Neronov 2021). In the case of 
gravitational wave events, for example, identifying the host 
galaxy and, ideally, the electromagnetic counterparts of these 
events is extremely challenging due to the sipnificant positional 
error ellipse of gravitational wave observations, 3100 deg? 
(Gehrels et al. 2016; Abbott et al. 2020). Statistically complete 
catalogs of large, nearby galaxies with accurate coordinates, 
size information, and multiband photometry like the SGA-2020 
are needed to help identify the most likely source of 
gravitational wave events (e.g., Gehrels et al. 2016; Ducoin 
et al. 2020; Kovlakas et al. 2021). 

Finally, we highlight one other area where the SGA-2020 is 
playing an important ancillarly role. The SGA-2020 is being 
used as a foreground angular mask for all the DESI dark-time 
cosmological tracers: luminous red galaxies (LRGs; Zhou et al. 
2023); emission-line galaxies (ELGs; Raichoor et al. 2023); 
and quasars (QSOs; Chaussidon et al. 2023). Like bright stars, 
large angular-diameter galaxies can bias the small-scale 
clustering signal in cosmological analyses because photometric 
pipelines tend to shred structurally resolved galaxies into many 
smaller sources. Similarly, the same angular mask can be used 
as an external input when building photometric catalogs from 
other imaging data; for example, an early version of the SGA- 
2020 was used to maximize the purity of the 3-5 jm unWISE 
photometric catalog of 2 billion infrared sources (Schlafly et al. 
2019). In other words, by providing a high-quality geometric 
mask, the SGA-2020 is helping these and other observational 
programs fulfill their scientific promise. 


6. Summary and Future Work 


We present the 2020 version of the SGA, SGA-2020, a 
multiwavelength optical and infrared imaging atlas of 383,620 
large angular-diameter galaxies covering +20,000 deg?" The 
SGA-2020 contains precise coordinates; optical (grz) and 
WISE/WI through W4 (3.4-22 jm) infrared mosaics; model 
images and photometry based on the state-of-the-art image 
modeling code, The Tractor; azimuthally averaged grz 
surface-brightness and color profiles; elliptical curves of 
‘growth and half-light radii; and extensive ancillary information 


T Excludi jest regions of dust extinction in the Galactic plane 
|b| >20°, the SGA-2020 covers 74% of the available =27,200 deg” of 
extragalactic sky, 
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for the full sample. The complete SGA: 
through the dedicated — web 
legacysurvey.org. 

Our measurements of the central coordinates, isophotal 
diameters, ellipticities, and position angles show overall good 
agreement with published measurements collated in the 
HyperLeda extragalactic database and in the WISE Extended 
Source Catalog of the 100 Largest Galaxies (WXSC-100; 
Jarrett et al. 2019). Disagreements in these quantities between 
the SGA-2020 and HyperLeda are generally due to erroneous 
measurements in HyperLeda; however, the comparison with 
WXSC-100 identifies small but notable (=few arcsecond) 
errors in the SGA-2020 coordinates for some of the largest 
C»5^) galaxies in the sample. 

We evaluate the completeness of the SGA-2020 by 
comparing against the HECATE all-sky, value-added galaxy 
catalog (Kovlakas et al. 2021). We find that the SGA-2020 is 
missing approximately 1% of known galaxies larger than 1’ and 
5% of galaxies between 25" and l^; however, we compute 
these statistics only after determining (through visual 
inspection) that nearly 30% of the sources larger than |' in 
HECATE that are missing from the SGA-2020 are either 
spurious or photometric shreds. Overall, we estimate that the 
SGA-2020 is more than 95% complete for galaxies larger tha 
R26) = 25" and r« 18 measured at the 26 magarcsec ^ 
isophote in the band. 

We discuss some of the potential scientific applications of 
the SGA-2020 and highlight the ongoing amassing of high- 
quality optical spectrophotometry from the DESI (DESI 
Collaboration et al. 2023b) survey. Approximately 70% of 
the SGA-2020 sample lies within the 14,000deg^ DESI 
footprint and is being targeted as part of the BGS (Hahn 
et al, 2023). To date, 7700 DESI spectra of SGA-2020 galaxies 
are publicly available as part of the DESI EDR (DESI 
Collaboration et al. 20234), representing less than 3% of the 
final sample. In addition, the DESI Peculiar Velocity secondary 
program (Saulder et al. 2023) is obtaining tens of thousands of 
major-axis, minor-axis, and other off-center spectra of SGA- 
2020 galaxies in order to constrain the peculiar-velocity field at 
z « 0.15, further enhancing the scientific impact of the SGA- 
2020. Finally, we highlight the potential impact of the SGA- 
2020 for identifying the electromagnetic counterparts of time- 
domain and multimessenger astronomical events. 

We conclude by discussing some of the planned improve- 
ments of the SGA. Future versions will focus on four broad 
areas, listed here in no particular order: 


portal, 


1. Completeness. As discussed in Section 4, the SGA-2020 
inherits the heterogeneity and incompleteness of the 
parent HyperLeda sample. To mitigate these issues, we 
intend to build a new parent sample of candidate large 
angular-diameter galaxies by detecting them from the 
Legacy Surveys imaging data themselves using a 
combination of traditional source-detection techniques 
(eg., convolution kernels optimized for detecting large 
galaxies) and one or more state-of-the-art deep-learning, 
techniques (e.g., Stein et al. 2022; Zaritsky et al. 2023). 
By inserting artificially generated galaxies of varying 
size, integrated flux, and surface brightness into the data, 
we intend to quantify the completeness limits of the 
sample as a function of these observational quantities. 

Centroiding and masking. Another area of improvement 
is how the SGA pipeline handles mergers and other 
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systems with two or more close companions (eg. in 
dense cluster environments like the Coma Cluster), as 
well as galaxies near bright stars, Evaluating and 
optimizing the performance of the pipeline in these 
circumstances will be part of a larger visual inspection 
effort (e.g, Walmsley et al. 2022) to ensure that the 
central coordinates of the largest galaxies in the SGA- 
2020 are accurate (or at least defensible, in the case of 
galaxies with irregular morphologies) compared with 
previous independent measurements (eg, see the 
discussion in Section 3.3). 

3. Background subtraction. As discussed in Section 3.2.1, 
aggressive subtraction of the Mosaic-3 pattern-noise 

nificantly impacts the z-band photometry of the SGA- 
2020 galaxies in the northem (BASS--MzLS) portion of 
the footprint, decl. 2 32°, including some of the most 
famous and largest angular-diameter galaxies in the 
sample like NGC 5194 = MSla and NGC 5457 = M101. 
Moreover, we show in Section 3.3 that the outer envelopes 
of the early-type, spheroidal galaxies in the atlas have 
likely been oversubtracted, thereby biasing their surface- 
brightness profiles and inferred isophotal diameters. We 
intend to address both these issues in a future version of 
the SGA (see, for example, Li et al. 2022 for a new sky- 
subtraction technique, which addresses the latter issue). 

4. Extensions in footprint and wavelength. Since the SGA- 
2020 was finalized, thousands of square degrees of 
additional DECam imaging have been acquired; we 
intend to use these data to extend the SGA footprint." In 
addition, in the next version of the SGA, we intend to (1) 
include DECam i-band imaging, which is available for 
more than 15,000deg” of area; (2) regenerate the 
unWISE coadds but also add ultraviolet (1528 and 
2271 A) coadds from the GALEX (Martin et al. 2005; 
Morrissey et al. 2005), where GALEX data are available; 
and (3) measure the surface-brightness profiles in all the 
available bandpasses from 0.15 to 22,m. These 
additional measurements will further increase the 
scientific impact and long-term legacy value of the SGA. 


Finally, future version of the SGA will also include ancillary 
spectroscopic redshifts and spectrophotometric measurements 
from DESI and, eventually, a wide range of physical properties 
(stellar masses, star formation rates, etc.) derived using state-of- 
the-art spectral energy distribution modeling (eg. Leja 
et al. 2017). 
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lolkam Du'ag (Kitt Peak) a mountain with particular 
significance to the Tohono O'odham Nation. 

NOIRLab is operated by the Association of Universities for 
Research in Astronomy (AURA) under a cooperative 
agreement with the National Science Foundation. 

This project used data obtained with the Dark Energy 
Camera (DECam), which was constructed by the Dark Energy 
Survey (DES) collaboration. Funding for the DES Projects h 
been provided by the U.S. Department of Energy, the U.S. 
National Science Foundation, the Ministry of Science and 
Education of Spain, the Science and Technology Facilities 
Council of the United Kingdom, the Higher Education Funding 
Council for England, the National Center for Supercomputing 
Applications at the University of Illinois at Urbana-Champaign, 
the Kavli Institute of Cosmological Physics at the University of 
Chicago, Center for Cosmology and Astro-Particle Physics at 
the Ohio State University, the Mitchell Institute for Funda- 
mental Physics and Astronomy at Texas A&M University, 
Financiadora de Estudos e Projetos, Fundacao Carlos Chag 
Filho de Amparo, Financiadora de Estudos e Projetos, 
Fundacao Carlos Chagas Filho de Amparo a Pesquisa do 
Estado do Rio de Janeiro, Conselho Nacional de Desenvolv 
mento Cientifico e Tecnologico and the Ministerio da Ciencia, 
Tecnologia e Inovacao, the Deutsche Forschungsgemeinschaft, 
and the Collaborating Institutions in the Dark Energy Survey. 
The Collaborating Institutions are Argonne National Labora- 
tory, the University of California at Santa Cruz, the University 
of Cambridge, Centro de Investigaciones Energeticas, Med- 
ioambientales y Tecnologicas-Madrid, the University of 
Chicago, University College London, the DES-Brazil Con- 
sortium, the University of Edinburgh, the Eidgenossische 
Technische Hochschule (ETH) Zurich, Fermi National 
Accelerator Laboratory, the University of Illinois at Urbana- 
Champaign, the Institut de Ciencies de l'Espai (IEEC/CSIC), 
the Institut de Fisica d'Altes Energies, Lawrence Berkeley 
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National Laboratory, the Ludwig Maximilians Universitat 
Munchen and the associated Excellence Cluster Universe, the 
University of Michigan, NSF's NOIRLab, the University of 
Nottingham, the Ohio State University, the University of 
Pennsylvania, the University of Portsmouth, SLAC National 
Accelerator Laboratory, Stanford University, the University of 
Sussex, and Texas A&M University. 

BASS is a key project of the Telescope Access Program 
(TAP), which has been funded by the National Astronomical 
Observatories of China, the Chinese Academy of Sciences 
(the Strategic Priority Research Program "The Emergence of 
Cosmological Structures" grant No. XDB09000000), and the 
Special Fund for Astronomy from the Ministry of Finance. 
The BASS is also supported by the External Cooperation 
Program of Chinese Academy of Sciences (grant No. 
1I4AIIKYSB20160057), and Chinese National Natural 
Science Foundation (grant No. 11433005). 

The Legacy Survey team makes use of data products from 
the Near-Earth Object Wide-field Infrared Survey Explorer 
(NEOWISE), which is a project of the Jet Propulsion 
Laboratory /California Institute of Technology. NEOWISE is 
funded by the National Aeronautics and Space Administration, 

The Legacy Surveys imaging of the DESI footprint is 
supported by the Director, Office of Science, Office of High 
Energy Physics of the U.S. Department of Energy under 
contract No. DE-ACO2-05CH1123, by the National Energy 
Research Scientific Computing Center, a Department of Energy 
Office of Science User Facility under the same contract; and by 
the U.S. National Science Foundation, Division of Astronom- 
ical Sciences under contract No. AST-0950945 to NOAO. 

Software: Ast ropy (Astropy Collaboration et al. 2013, 
2018, 2022), corner. py (Foreman-Mackey 2016), healpy 
(Zonca et al. 2019), NumPy (Harris et al. 2020), Matplot lib 
(Hunter 2007), Photutils (Bradley 2023), PyDL (Weaver 
et al 2019), scipy (Virtanen et al. 2020) seaborn 
(Waskom 2021). 


Appendix A 
HyperLeda Database Query 


In Section 2.1, we present our procedure for building the 
SGA-2020 parent sample, which begins with a HyperLeda 
database query. For the purposes of reproducibility, this 
appendix documents the exact query we execute on the 2018 
November 14 version of the HyperLeda database, which results 
in a catalog of 1,436,176 galaxies: 

WITH 

"R50" AS 

( 

SELECT pgc, avg (lax) AS lax, avg (sax) AS sax 

FROM rawdia 

WHERE quality-0 and dcode-5 and band between 

4400 and 4499 GROUP BY pgc 

n 

"IR" AS 

( 

SELECT pgc, avg (lax) AS lax, avg (sax) AS sax 

FROM rawdia 

WHERE quality-0 and iref in 

dcode-7 and band-0 GROUP BY pgc 

) 

SELECT 
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m.pgc, m.objname, 
de2000, m.type, 
m.bar, m.ring, 
m.t, m.logd25, 
m.lagr25, m.pa, 
modbest, 
"R50".lax, "R50".sax, "IR".lax, "IR".sax, 
FROM 

m000 AS m 

LEFT JOIN "R50" USING (pge) 

LEFT JOIN "IR" USING (pgc) 

WHERE 

obj typ: and (m.logd25>0.2 or 
lax»0.2 or "IR".1ax>0.2) 


m.objtype, m.al2000, m. 


m.multiple, m.compactness, 


m.bt, mit, mkt, mv, m 


"RSO". 


Appendix B. 
Data Products and Data Access 

In this appendix, we define the data model for the SGA-2020 
data products and describe how all the data can be accessed. All the. 
input imaging data used to construct the SGA-2020, including the 
6 year unWISE image stacks, are publicly accessible through 
https:/ /www.legacysurvey.org. The SGA-2020 data themselves 
can be retrieved through the SGA-2020 Data Portal at https: / /sea. 
legacysurvey.org. This portal provides a searchable database for 
retrieving data and visualizations for individual galaxies, as well as 
bulk-download access to the full data set. For example, the 
following link will navigate directly to the beautiful face-on spiral 
(Hubble type SABc) NGC 2532: https://sga legacysurvey.org/ 
group/NGC2532. In addition, the data can be accessed (and 
cross-referenced against a growing number of additional data sets, 
including DESI) through NSF's NOIRLab Astro Data Lab at 
https: //datalab.noirlab.edu/sea php. Finally, one can interactively 
explore the sample in the Legacy Surveys imaging footprint via the 
SGA layer of the Legacy Surveys Viewer; for example, the 
following link will navigate to the position of NGC 2532 in the 
North Galactic Cap: https: //legacysurvey.org,/viewer ira=122. 
5638&dec=33.9568&layer=ls-dr9&zoom=13&sga. 
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Table 1 
ScA-2020. fits File Contents 


LII Data Model 
Number Name —  Documentadon Description 
HDUoI  ELDLIPSE Table? Sample metadata and ellipse- 


fitting results. 
‘The Tractor fitting results” 


HDUB m 


CTOR — LS/DRO website” 


Notes. Catalog containing the principal measurements for all 383,620 galaxies 
in the SGA 2020 as a muliextension FITS file with two row-matched header 
and/or data units (DU. 
* s/w depacysurvey org [r9 /catalogs 

This table also includes SGA_TD (see Table 2) to facilitate erossmatching 


Most users will be interested in the SGA~2020. Fits file, a 
multiextension FITS catalog, which contains detailed informa 
tion for all 383,620 galaxies in the SGA-2020. Table 1 
summarizes the contents of this file, and Table 2 contains a 
detailed description of the ELLIPSE header and/or data unit 
(HDU). In addition, Table 3 defines the ELLIPSEBIT bit- 
masks, which record issues associated with the ellipse fitting 
and surface-brightness profile modeling, if any 

Furthermore, for each galaxy group in the atlas (ie., each 
row in the SGA-2020.fits catalog where GROUP. 
PRIMARY is True), we generate the set of files summarized 
in Table 4 (see also Section 3.2), which include the individual 
multiwavelength mosaics, Tractor catalogs, surface-bright- 
mess profiles, and other key data products. These files are 
organized into the directory structure RASLICE/GROUP_ 
NAME, where GROUP_NAME is the name of the galaxy group 
(see Section 2.2), and RASLICE, which ranges between 000 
and 359, is the 1° wide slice of the sky that the object belongs 
to. Specifically, in Python, 


RASLICE-": 


3d" , format (GROUP RA) 


Finally, Table 5 documents the data model of the ellipse-fitting 
and surface-brightness-profile results for each individual galaxy 
in the SGA-2020. 
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Table 2 
ELLIPSE HDU Data Model 


uiis Description 


sigue integer identifier. 
SGA galaxy name, constructed as "SGA-2020.56; 
Unique galaxy name. 

Unique identifier from the Principal Catalogue of Galaxies. 


ID" 


RALLEDA degree R.A. (12000) from the reference indicated in REE. 

DEC_LEDA desee Deel. 02000) from the reference indicated in 5 

MORPHIYPE Visual morphological type from HyperLeda (if available). 

PA_LEDA desee Galaxy position angle (measured east of north); taken from the reference indicated in REF. 

D25 LEDA areminute  Major-axis diameter at the 25 mag arcsec“? (optical) surface-brighiness isopbote; taken from the 
reference indicated in REF. 

BA_LEDA Ratio of the semiminor axis to the semimajor axis; taken from the reference indicated in REF, 

ziena" Heliocentic redshift from HyperLeda. 

5B_D25_LEDA Vegamuparsce? Mean surface brightness based on D25_LEDA amd MAG LEDA. 


MAG LEDA* Vega mag Apparent brband magnitude. 

BYHAND Boolean fag indicating whether one or more of FA. LEDA, DEC_LEDA, D2S_LEDA, PA_LEDA, 
BA_LEDA, or MAG_LEDA were changed from their published values (usually via visual 
inspection) while building the parent sample. 


REF Reference indicating the origin of the object (see Section 2): LEDA-20181114, LGDWARFS, RC3, 
enNGC, or DAB, 

GROUP 1D Unique group identification number. 

GROUP NAME Group name, constructed from the name of its largest member (see Section 2.2). For isolated galaxies, 
identical to GALAXY. 

GROUP MULT Number of group members (ie.. group multiplicity). 

GROUP_PRIMARY Boolean fag indicating the primary (ie. largest) group member. 

GROUP_RA degree Mean R.A. of the group weighted by D25_LEDA. 


GROUP DEC degree Mean decl. of the group weighted by D2*. LEDA 
GROUP_DIAME! arcminute Approximate group diameter (see Section 2.2) 


BRICKNAME Name of custom Tractor “brick; encoding the sky position, e.g., *1126p222” is centered on R. 
A. = 11256, decl. = 122. 

RA degree R.A. (2000) based on The Tractor model fit. 

E degree Decl. (12000) based on The Tractor model fi 

D26 arcminute Major axis diameter at the ji = 26 mag arcsec“? r-band isophote. 

26_REEY Reference indicating the origin of the DIAM measurement: S826, 3825, or LEDA. 

PA degree Galaxy position angle (measured east of north), as measured from the e11ipse moments (see 
Section 32.3) (or equivalent to PA, LED if the ellipse moments could not be measured), 

BA Minoro-major axis ratio, as measured from the e11ipse moments (or equivalent 10 BA_LEDA ifthe 


ellipse moments could not be measured). 
RA MOMENT degree Light-weighted RA. (12000), as measured from the &11:pse moments. Equivalent to 7A. X0 in 
"Table 5 but set to RA_LEDA if ellipse fiting failed or was not carried out 


DEC_MOMENT degree Like RA MOMENT but forthe decl. axis 
SMA MOMENT“ arcsecond Second moment of the light distribution along the major axis based on the measured e1lipse. 
moments. Equivalent to MAJORAXIS in Table 5 but converted to arcseconds. 

arcsecond Half-light semimgjor axis length based on Equation (2) 
SMA_SB(sblev arcsecond Semimajor axis length at the r-band (sbevel)magamsee " isophote. 

AB mag Cumulative brightness measured within SA, SB(sblevel). 

AB mag 10 uncertainty in (g72) MAG SB[sblevel) 

AB mag Bestiting curve-of-growth parameter m from Equation (1). 


Bestiting curve-of-growth parameter my from Equation (1). 
Bestfiting curve-of-growth parameter o, from Equation (1). 
Bestfiting curve-of-growth parameter o from Equation (1). 
(ar2). coc. i Y 1 of the fit to the curve of growth (see Section 3.2.3). 
ELLIPSEBIT See Table 3. 


Notes, ELLTPSE HDU of the SGA-2020. £i ts merged catalog defined in Table 1. In this table, (grz) denotes the gr 
oF W4 bandpass; and (sblevel) represents the 22, 22.5, 23, 23.5, 24, 24.5, 25, 25.5, and 26 mag aresee-* isophote. 
values are represented with a —1. For some quantities (eg. PGC or 2_LEDA),a missing value does not necessarily mean that that value does not exist 
This magnitude estimate is heterogeneous in both bandpass and aperture but, for most galaxies, itis measured in the B band: use with care. 

7 By default, we infer 026 from SMA 5826. However, if the r-band surface-hrightness profile could not be measured at this level, we estimate 026 as 
25 x SMA_SB25 or 15 x D25_LEDA, in that onder of priority. 

# W ellipseciting or eurve-of-growth modeling filed or was not attempted, then these columns’ values are -1 


mand filer; (wis) denotes the Wl, 
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Table 3 
Ellipse-iting Bitmasks 


Bit Number Tit Name Deliniion 


T Not used ignore. 
Ellipse fit skipped: galaxy classified as too-small type REX 
Ellipse fit skipped; galaxy classified as too-small type EXP, 
Ellipse fining was attempted but filed, 

NOTFIT Ellipse ting was not attempted. 

REJECTED Ellipse-fitng results were rejected based on visual inspection, 


Note. Bitmask encoding various reasons why ellipse fitting filed or was not attempted; see Section 3.2.3 for additional details. 


Table 4 
Images and Catalogs 

Filename Description 

GROUP_NAME-ceds-(region) fits ‘CCDs contributing to the optical image stacks 


GROUP_NAME-largegalaxy-blobs.fis 


Enumerated segmentation (“blob”) image. 


> NAME-largegalaxy ractor.fts ictor catalog of all detected sources in the field. 
GROUP_NAME-Jargegalaxy-maskbits. fs. Image encoding the LS/DR9 bitmasks* contributing to each pixel. 
GROUP_NAME-argeyalaxy-outlier-mask fs. Image of pixels rejected during outlier masking 
GROUP WAME-deplh- ar) fit Image of the Sz point-source depth at each pixel 

LNAME-largegalaxy-pst- oz) its. Postage stamp of the inverse-variance weighted mean pixelized PSF at the center of the field. 


GROUP. NAME-argegalaxy-imus 
GROUP_NAME-largegalaxy-invvar- (gz z)-fils.fz 
GROUP_NAME-largegalaxy-model-(gr z) fits.f2 


-— Inverse-variance weighted optical image. 
e variance image stack, 
etor model image coadd. 


P NAME-argegulaxy-image-arz jp Color (grz) JPG image of the image stack. 
P _NAME-argegalaxy-model-gr2 jpg Color (grz) IPG image of The Tractor model image. 
P NAME-largegalaxy-resi-grz jpg Color (grz) JPG image of residual (data minus model) image. 
P NAME-image- ri se) is. Inverse-variance weighted infrared image stack. 
IP _NAME-invvar- (w se)-fits.f2 Infrared inverse variance image stack. 

GROUP. NAME-argegalaxy-model- [i se) fits Infrared Tractor model image coadd, 
P NAME-image- WIW2jpg Color (W1W2) JPG image of the image stack 


NAME -model-WIW2 jpe Color (W1W2) IPG image of The 7: 
_NAME-largegalay-sample fs Catalog of one or more galaxies from the parent sample in this group. 
NAME largegalaxy-SGA_1D-llips fits” Detailed elipse-iting results foreach galaxy in this group: see Table 5 for the data model. 
(P _NAME-coadds-log Log output for the coadds stage of the SGA pipeline. 
 NAM-ellipse log Log output for the #1 Lip: of the SGA pipeline. 


Notes. Summary of files generated for each galaxy group in the SGA-2020. In this table, (region) denotes either north for BASS/MzLS or south for DECALS: 
(a12) denotes the g-, r-, or z-band filter; and (yi se) W3-, or W4 bandpass, 

2 hups:/ www legocysurvey org /de9 /bitmasks 

* This file may be missing (for the galaxy of a given SGA. 2D) if ellipse fiting fails or is not carried out 


Table 5 
Ellipse-fiting Data Model 

(Column Name Unite Description 

SA ID ‘See Table 2 

GALAXY See Table 2 


RA degn See Table 2. 


DE degn See Table 2. 

P See Table 2. 

PAED degree See Table 2. 

BA_LEDA See Table 2 

25_LEDA sreminute See Table 2 

BANDS List of bandpasses fitted (here, always gr) 

REFEAND Reference band (here, always r) 

REFPIXSCALE arcsec pixe! Pixel scale in REFBAND. 

E Flag indicating ellipse-fing succes or failure 

FITGEOMETRY Flag indicating whether the ellipse geometry was allowed to vary with semimajar axis (here, always 
False). 

INPUT ELLIPSE Flag indicating whether ellipse parameters were passed from an external file (here, always Fa 

IARGESHIET Flag indicating that the light-weighted center (from e11 ipse moments) is different from The 


Tractor position by more than 10 pixels in either dimension, in which case we adopt The 


model position. 
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Table 5 
(Continued) 

Column Name Units Description 

RA xo [m RA, (2000) at pixel position X. 

DEC Y0 degree Decl. (12000) at pixel position Y0. 

xo pixel Light-weighted position along the x-axis (fom e11ipse moments). 

vo pixel Light-weighted position along the y-axis (from e11ipse moments). 

EPS EEllipticity, £ = 1 — b/a, where b/a is the semiminor to semimajor axis ratio BA in Table 1 

PA degree Galaxy position angle (astronomical convention, measured east of north): equivalent to PA in Table I. 

THETA degree Galaxy position angle (physics convention, measured north of west) given by 270-PA mod 180, 

MAJORAXIS pixel Light-weighted length of ihe semimajor axis (from e11 ipse moments). 

MAXSMA pixel ‘Maximum semimajor axis length used for the ellipse-fting and curve-of-growth measurements 
(typically taken to be 2 x MAJORAXIS). 

INTEGRNODE* Integration mode (here, always median). 

p o clipping threshold (here, always 3). 

NCLIP © clipping iterations (here, always 2) 

PEFBIZE, (g: arsec Mean FWHM of the point-spread function over the full mosaic (derived from the PSFS128 gr 
columns in The Tractor catalogs). 

PSFDEPTH_( AB mag Mean Sa point-source depth over the full mosaic (derived from the PSFDEPTH [gr columns in 
The Tractor catalogs). 

Mi TRANSMISSION [grz) Galactic transmission fraction (taken from the corresponding Tractor catalog at the central 
coordinates of the galaxy), 

REFBAND WIDTH pixel Width of the optical mosaics in REFEAND, 

REFBAND_HEIGHT pixel Height of the optical mosaics in REFBAND. 

2) SMA pixel Ellipse semimajor axis position. 


2)_INTENS 


(arz) X0 ERR 
(sr) vo 


(grz)-Y0 ERR 


RADIUS SB(sblevel) 
RADTUS Sh(sblevel) 
2). MAG Sh(sblevel) 


(2r) MAG, SB[sbl evel) 


monomaggiesarcsec 
nanomaggies asec * 


degree 

degree 
pixel 
pixel 
pixel 
pixel 


nanomaggies asec * 


nanomaggies 


Linear surface brightness at 
To uncertainty in (gr2)_INTENS. 
Ellipse ellipticity: ere, fixed at EPS. 

lc uncertainty in (grz) EPS. 

Ellipse position angle; here, fixed at PA. 

Le uncertainty in (gr2)_PA. 

Ellipse saxis pixel coordinate; here, fixed at X0. 
1o uncertainty in (grz) X9. 

Ellipse y-axis pixel coordinate; here, fixed at Y0. 
le uncertainty in (grz) 0. 

‘Thind-onler harmonic coefficient; not used. 

1o uncertainty in (gr2)_A3. 

FFourth-order harmonic coefficient; not used, 

Le uncertainty in (grz) A4 

Rms surface brightness along the elliptical path. 
Pixel standard deviation estimate. 

Fitting stop code. 

Number of data points used for the fi. 

Number of points rejected during the fi 
Number of fiting iterations. 

Do not use; see Appendix C. 

Do not use: see Appendix C. 

Do not use; see Appendix C. 

Do not use; see Appendix C. 

Do not use; see Appendix C. 

Do not use: see Appendix C. 

Do not use; see Appendix C. 

Do not use; see Appendix C. 

Do not use; see Appendix C. 

Do not use; see Appendix C. 

Do not use; see Appendix C. 

Do not use: see Appendix C. 


Notes. Ellipsefiting results and surface-brightness profiles for a single galaxy in the SGA-2020; specifically, this table documents the data model for ihe 


GROUP. NAME-argegalaxy-5C^. 1D-ellipse.fits fle listed in Table 4. In this table, 
23, 23.5, 24, 245, 25, 25.5, and 26 mag arcsec” ^ isophote. 

icphote.Ellipse.fit image method documentation. 
P See the photutils. isophote. Isophote and photutils.isophot. 


P See the photutils. 


ms. 


2) denotes the g-, 7» or -band flle, and (sb eve) represents the 


IscphoteList method documentation. 
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Appendix C 
Known Issues 


Not surprisingly, a catalog of the size and complexity of the 
SGA-2020 has imperfections, most of which were identified after 
the fitting was finalized. In this appendix, we document the 
currently known issues, all of which we intend to address in future 
versions of the SGA. For additional details and the most up-to- 
date documentation regarding these and other issues, we refer the 
interested reader to the following URL: hitps://github.com/ 
moustakas/SGA /issues?q=is%3Aopen-+is%3Aissue-+label% 
3A%22SGA-2020 + Issue%22. 

In terms of the data reduction, the most significant known 
issue impacting the SGA-2020 is the pattern-noise subtraction 
from the Mosaic-3 imaging, which distorts the z-band surface- 
brightness profiles and colors of galaxies in the (northern) 
BASS+MZLS portion of the footprint (see Section 3.2.1 for 
details). In addition, SGA users have reported that in some 
cases the WISE WI and W2 background signal has been 
oversubtracted. We hypothesize that this oversubtraction ari 
because the unWISE background is modeled as the median flux 
after subtracting all the point sources in a 1^ x 1 grid (Schlafly 
et al, 2019), which would potentially impact all galaxies in the 
sample larger than approximately I. 

‘Some issues also impact the availability and accuracy of the 
optical photometry in the SGA-2020. The most significant 
problem is that the elliptical aperture photometry (and the 
corresponding curves of growth) reported in the individual 
GROUP_NAME-largegalaxy-SGA_1D-ellipse.fits files (see Table 
4) were impacted by a catastrophic bug, which rendered these 
measurements unusable. The data model for this file is 
documented in Table 5, and the impacted columns are flagged 
with the text "Do not use; see Appendix C." Fortunately, 
however, we were able to recover the curves of growth from 
the surface-brightness profiles themselves, which were not 
affected by this bug; we record those measurements in the 
ELLIPSE HDU of the merged SGA-2020. fits file (as 
documented in Table 2). 

Moreover, ellipse fitting was skipped, failed, or rejected for a 
few different reasons, which are encoded in the ELLIPSEBIT 
column; this column appears in Table 2 and is documented in 
Table 3. First, a total of 8415 galaxies were not ellipse-fit 
because they were deemed to be too small 
(ELLIPSEBIT = 2! or ELLIPSEBIT = 2°) via their best-fitting 

‘ractor model and halflight radius (shape_r); specifi- 
cally, the objects modeled by The Tractor as type=REX 
with shaper<2" or type=(EXP,DEV,SER} with 
shape. r < 5" were not ellipse-fit.'? Second, ellipse fitting did 
not complete (ELLIPSEBIT = 2") for 27 galaxies in the 
following seven groups: IC 1613, NGCOOSS Group, 
NGC0253 Group, NGC0300 Group, NGCOS98 Group. 
NGC 3031 Group, and NGC 5457. The central galaxies in 
these groups rank among the largest in the sample 
(18' < GROUP. DIAMETER < 62^), so modeling these systems 
posed some especially acute computational challenges. Third, 
the ellipse-fitting results for 52 galaxies were rejected 
(ELLIPSEBIT =25) because they were found via visual 
inspection to be incorrect or unreliable, usually due to 
incomplete or imperfect masking of nearby bright stars or 


‘Factor type and shape_r measurements are fully documented at 
hups:/ /www.legacysurvey org /de9/catalogs/fregion-tractor-aaa-tractor- 
bricks, 
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other galaxies. Finally, ellipse fitting was not carried out (or 
silently failed) on 6161 galaxies where the pipeline did not 
indicate a problem, and therefore, ELLTPSEBIT=0 for these 
systems even though there are no measured surface-brightness 
profiles 

In no particular order, some of the additional currently 
known issues include the following: 


1. A small number of objects without imaging in all three 

grz bandpasses made it into final sample, despite the 

requirement of three-band optical imaging discussed in 

Section 3.2."" 

A small fraction of objects have compromised surface- 

brightness profiles due to their proximity to bright stars'^ 

or because of poor deblending of multiple sources 

(typically due to incompleteness in the parent HyperLeda 

catalog). ^ 

3. The central coordinates for a small number of the largest. 
(51) galaxies in the SGA-2020 are incorrect by up to a 
few arcseconds, as discussed in Section 3.3. 

4. In a handful of spheroidal galaxies with large ellipticity 
(>05 or b/a<0.5), our masking algorithm (see 
Section 3.2.2) is too aggressive and inadvertently masks 
some of the outer-envelope light of the galaxy along the 
minor axis. 
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